SeQual: an unsupervised feature selection method for cloud workload traces
نویسندگان
چکیده
Abstract One challenge of studying cloud workload traces is the lack available users’ identities. Therefore, clustering methods were used to address this through extracting these identities from traces. For better extraction, it beneficial select attributes (columns in traces) for by using feature selection methods. However, use general requires details that are not (e.g. predefined number clusters). paper, we present an unsupervised method identify good candidate clustering. This uses Silhouette coefficients rank best extraction The performance our SeQual evaluated comparison with commonly (supervised and unsupervised) help quality metrics (i.e. adjusted rand index, entropy precision). results show can compete supervised perform than ones, average accuracy between 90% 99%.
منابع مشابه
Feature Selection for Unsupervised Learning
In this paper, we identify two issues involved in developing an automated feature subset selection algorithm for unlabeled data: the need for finding the number of clusters in conjunction with feature selection, and the need for normalizing the bias of feature selection criteria with respect to dimension. We explore the feature selection problem and these issues through FSSEM (Feature Subset Se...
متن کاملHierarchical fuzzy filter method for unsupervised feature selection
The problem of feature selection has long been an active research topic within statistics and pattern recognition. So far, most methods of feature selection focus on supervised data where class information is available. For unsupervised data, the related methods of feature selection are few. The presented article demonstrates a way of unsupervised feature selection, which is a two-level filter ...
متن کاملEmbedded Unsupervised Feature Selection
Sparse learning has been proven to be a powerful technique in supervised feature selection, which allows to embed feature selection into the classification (or regression) problem. In recent years, increasing attention has been on applying spare learning in unsupervised feature selection. Due to the lack of label information, the vast majority of these algorithms usually generate cluster labels...
متن کاملUnsupervised Personalized Feature Selection
Feature selection is effective in preparing high-dimensional data for a variety of learning tasks such as classification, clustering and anomaly detection. A vast majority of existing feature selection methods assume that all instances share some common patterns manifested in a subset of shared features. However, this assumption is not necessarily true in many domains where data instances could...
متن کاملRobust Unsupervised Feature Selection
A new unsupervised feature selection method, i.e., Robust Unsupervised Feature Selection (RUFS), is proposed. Unlike traditional unsupervised feature selection methods, pseudo cluster labels are learned via local learning regularized robust nonnegative matrix factorization. During the label learning process, feature selection is performed simultaneously by robust joint l2,1 norms minimization. ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Journal of Supercomputing
سال: 2023
ISSN: ['0920-8542', '1573-0484']
DOI: https://doi.org/10.1007/s11227-023-05163-w